Method and apparatus for equalizing a speech signal generated within a pressurized air delivery system

ABSTRACT

A method for equalizing a speech signal generated within a pressurized air delivery system, the method including the steps of: generating an inhalation noise model ( 1152 ) based on inhalation noise; receiving an input signal ( 802 ) that includes a speech signal; and equalizing the speech signal ( 1156 ) based on the noise model.

REFERENCE TO RELATED APPLICATIONS

The present invention is related to the following U.S. applicationscommonly owned together with this application by Motorola, Inc.:

-   -   Ser. No. ______, filed Jun. 30, 2004, titled “Method an        Apparatus for Detecting and Attenuating Inhalation Noise in a        Communication System” by Harton, et al. (attorney docket no.        CM06913G); and    -   Ser No. ______, filed Jun. 30, 2004, titled “Method and        Apparatus for Characterizing Inhalation Noise and Calculating        Parameters Based on the Characterization” by Kushner, et al.        (attorney docket no. CM06915G).

FIELD OF THE INVENTION

The present invention relates generally to a pressurized air deliverysystem coupled to a communication system.

BACKGROUND OF THE INVENTION

Good, reliable communications among personnel engaged in hazardousenvironmental activities, such as fire fighting, are essential foraccomplishing their missions while maintaining their own health andsafety. Working conditions may require the use of a pressurized airdelivery system such as, for instance, a Self Contained BreathingApparatus (SCBA) mask and air delivery system, a Self ContainedUnderwater Breathing Apparatus (SCUBA) mask and air delivery system, oran aircraft oxygen mask system. However, even while personnel are usingsuch pressurized air delivery systems, it is desirable that good,reliable communications be maintained and personnel health and safety beeffectively monitored.

FIG. 1 illustrates a simple block diagram of a prior art system 100 thatincludes a pressurized air delivery system 110 coupled to acommunication system 130. The pressurized air delivery system typicallyincludes: a breathing mask 112, such as a SCBA mask; an air cylinder(not shown); a regulator 118; and a high pressure hose 120 connectingthe regulator 118 to the air cylinder. Depending upon the type of airdelivery system 110 being used, the system 110 may provide protection toa user by, for example: providing the user with clean breathing air;keeping harmful toxins from reaching the user's lungs; protecting theuser's lungs from being burned by superheated air inside of a burningstructure; protecting the user's lungs from water; and providingprotection to the user from facial and respiratory burns. Moreover, ingeneral the mask is considered a pressure demand breathing systembecause air is typically only supplied when the mask wearer inhales.

Communication system 130 typically includes a conventional microphone132 that is designed to record the speech of the mask wearer and thatmay be mounted inside the mask, outside and attached to the mask, orheld in the hand over a voicemitter port on the mask 112. Communicationsystem 130 further includes a communication unit 134 such as a two-wayradio that the mask wearer can use to communicate her speech, forexample, to other communication units. The mask microphone device 132may be connected directly to the radio 134 or through an intermediaryelectronic processing device 138. This connection may be through aconventional wire cable (e.g., 136), or could be done wirelessly using aconventional RF, infrared, or ultrasonic short-rangetransmitter/receiver system. The intermediary electronic processingdevice 138 may be implemented, for instance, as a digital signalprocessor and may contain interface electronics, audio amplifiers, andbattery power for the device and for the mask microphone.

There are some shortcomings associated with the use of systems such assystem 100. These limitations will be described, for ease ofillustration, by reference to the block diagram of FIG. 2, whichillustrates the mask-to-radio audio path of system 100 illustrated inFIG. 1. Speech input 210 (e.g., S_(i)(f)) from the lips enters the mask(e.g. a SCBA mask), which has an acoustic transfer function 220 (e.g.,MSK(f)) that is characterized by acoustic resonances and nulls. Theseresonances and nulls are due to the mask cavity volume and reflectionsof the sound from internal mask surfaces. These effects characterized bythe transfer function MSK(f) distort the input speech waveform S_(i)(f)and alter its spectral content. Another sound source is noise 230generated from the breathing equipment (e.g. regulator inhalation noise)that also enters the mask and is affected by MSK(f). Another transferfunction 240 (e.g., NP_(k)(f)) accounts for the fact that the noise isgenerated from a slightly different location in the mask than that ofthe speech. The speech and noise S(f) are converted from acousticalenergy to an electronic signal by a microphone which has its owntransfer function 250 (e.g., MIC(f)). The microphone signal thentypically passes through an audio amplifier and other circuitry, whichalso has a transfer function 260 (e.g., MAA(f)). An output signal 270(e.g., S_(o)(f)) from MAA(f) may then be input into a radio for furtherprocessing and transmission.

Returning to the shortcomings of systems such as system 100, an exampleof such a shortcoming relates to the generation by these systems of loudacoustic noises as part of their operation. More specifically, thesenoises can significantly degrade the quality of communications,especially when used with electronic systems such as radios. One suchnoise that is a prominent audio artifact introduced by a pressurized airdelivery system, like a SCBA system, is regulator inhalation noise,which is illustrated in FIG. 2 as box 230.

The regulator inhalation noise occurs as a broadband noise burstoccurring every time the mask wearer inhales. Negative pressure in themask causes the air regulator valve to open, allowing high-pressure airto enter the mask and producing a loud hissing sound. This noise ispicked up by the mask communications system microphone along withensuing speech, and has about the same energy as the speech. Theinhalation noise generally does not mask the speech since it typicallyoccurs only upon inhalation. However, it can cause problems—examples ofwhich are described as follows. For example, the inhalation noise cantrigger VOX (voice-operated switch) circuits, thereby opening andoccupying radio channels and potentially interfering with other speakerson the same radio channel. Moreover, in communication systems that usedigital radios, the inhalation noise can trigger VAD (Voice ActivityDetector) algorithms causing noise estimate confusion in noisesuppression algorithms farther down the radio signal processing chain.In addition, the inhalation noise is, in general, annoying to alistener.

A second shortcoming of systems such as system 100 is described below.These systems use masks that typically encompass the nose and mouth, orthe entire face. The air system mask forms an enclosed air cavity offixed geometry that exhibits a particular set of acoustic resonances andanti-resonances (nulls) that are a function of mask volume and internalreflective surface geometries, and that alters the spectral propertiesof speech produced within the mask. More specifically, in characterizingthe air mask audio path (FIG. 2), the most challenging part of thesystem is the acoustic transfer function (220) from the speaker's lipsto the mask microphone. These spectral distortions can significantlydegrade the performance of attached speech communication systems,especially systems using parametric digital codecs that are notoptimized to handle corrupted speech. Acoustic mask distortion has beenshown to affect communication system quality and intelligibility,especially when parametric digital codecs are involved. Generally, asidefrom the inhalation noise, the air system effects causing the largestloss of speech quality appear to be due to the poor acoustics of themask.

FIG. 3 illustrates an example of a measured spectral magnitude responseinside the mask (320) and at the mask microphone output (310) and acalculated combined transfer function (330) for the mask, microphone,and microphone amplifier. These particular data were obtained using aSCBA mask mounted on a head and torso simulator. The acoustic excitationconsisted of a 3 Hz-10 KHz swept sine wave driving an artificial mouthsimulator. As FIG. 3 illustrates, the spectrum is significantlyattenuated at frequencies below 500 Hz and above 4.0 KHz, mostly due toa preamp band pass filter in the microphone, and contains a number ofstrong spectral peaks and notches in the significant speech pass bandregion between 50 and 4.0 KHz. These spectral peaks and notches aregenerally caused by reflections inside the mask that cause combfiltering, and by cavity resonance conditions. The significant spectralpeaking and notching modulate the speech pitch components and formantsas they move back and forth through the pass band, resulting in degradedquality and distorted speech. It may be desirable to determine atransfer function or transfer functions characterizing such a systemwith such transfer functions being used to define an equalization systemto reduce speech distortion.

A number of proven techniques exist to adaptively determine a systemtransfer function and equalize a transmission channel. One effectivemethod to determine a system transfer function is to use a broadbandreference signal to excite the system and determine the systemparameters. A problem in estimating the transfer function of many speechtransmission environments is that a suitable broadband excitation signalis not readily available. One common approach is to use the long-termaverage speech spectrum as a reference. However, adaptation time usingthis reference can take a long time, particularly if the speech input issparse. In addition, the long-term speech spectrum can vary considerablyfor and among individuals in public service activities that frequentlyinvolve shouting and emotional stress that can alter the speech spectrumconsiderably.

Another shortcoming associated with systems such as system 100 is thelack of more efficient methods and apparatus for measuring certainparameters of the mask wearer including, for example, biometricparameters. Measurement of such parameters of individuals working inhazardous environments, who may be using systems such as system 100, isimportant for monitoring the safety and performance of thoseindividuals. For example, measurements of the individual's respirationrate and air consumption are important parameters that characterize hiswork-load, physiological fitness, stress level, and consumption of thestored air supply (i.e. available working time). Conventional methods ofmeasuring respiration involve the use of chest impedance plethysmographyor airflow temperature measurements using a thermistor sensor. However,getting reliable measurements, using these conventional methods, fromindividuals working in physically demanding environments such asfirefighting is more difficult due to intense physical movement that cancause displacement of body-mounted sensors and artifacts typically usedto take the measurements.

Thus, there exists a need for methods and apparatus for effectivelydetecting and attenuating inhalation noise, equalizing speech (i.e.,removing distortion effects), and measuring parameters associated withusers in a system that includes a pressurized air delivery systemcoupled to a communication system.

BRIEF DESCRIPTION OF THE FIGURES

A preferred embodiment of the invention is now described, by way ofexample only, with reference to the accompanying figures in which:

FIG. 1 illustrates a simple block diagram of a prior art system thatincludes a pressurized air delivery system for breathing coupled to acommunication system;

FIG. 2 illustrates the mask-to-radio audio path of the systemillustrated in FIG. 1;

FIG. 3 illustrates an example of a measured spectral magnitude responseinside a mask and at the mask microphone output and a calculatedcombined transfer function for the mask, microphone, and microphoneamplifier;

FIG. 4 illustrates an example of an inhalation noise generated by a SCBAair regulator;

FIG. 5 illustrates the long-term magnitude spectrum of the inhalationnoise illustrated in FIG. 4;

FIG. 6 illustrates four overlapping spectra of inhalation noisesgenerated by a single speaker wearing a given SCBA mask;

FIG. 7 illustrates audio output from a SCBA microphone showinginhalation noise bursts intermingled with speech;

FIG. 8 illustrates a simple block diagram of a method for detecting andeliminating inhalation noise in accordance with one embodiment of thepresent invention;

FIG. 9 illustrates a simple block diagram of one embodiment of aspectral matcher used in the method of FIG. 8;

FIG. 10 illustrates a simple block diagram of another embodiment of aspectral matcher used in the method of FIG. 8;

FIG. 11 illustrates a simple block diagram of a method for equalizing aspeech signal in accordance with another embodiment of the presentinvention;

FIG. 12 illustrates an inhalation noise spectrum before equalization ascompared to the spectra after 14^(th) order and 20^(th) order LPCinverse filter equalization in accordance with the present invention;

FIG. 13 illustrates a simple block diagram of a method for determiningthe duration of frequency of inhalation noise and determiningrespiration rate and air usage volume in accordance with anotherembodiment of the present invention for use in measuring biometricparameters;

FIG. 14 illustrates a signal from a microphone input that containsspeech and air regulation inhalation noise;

FIG. 15 illustrates the average normalized model error of the signalillustrated in FIG. 14 as determined by the method illustrated in FIG.13;

FIG. 16 illustrates the inhalation noise detector output signal asgenerated by the method illustrated in FIG. 13; and

FIG. 17 illustrates the integrated inhalation detector output asgenerated by the method illustrated in FIG. 13.

DETAILED DESCRIPTION OF THE INVENTION

While this invention is representative of embodiments in many differentforms, there are shown in the figures and will herein be described indetail specific embodiments, with the understanding that the presentdisclosure is to be considered as an example of the principles of theinvention and not intended to limit the invention to the specificembodiments shown and described. Further, the terms and words usedherein are not to be considered limiting, but rather merely descriptive.It will also be appreciated that for simplicity and clarity ofillustration, elements shown in the figures have not necessarily beendrawn to scale. For example, the dimensions of some of the elements areexaggerated relative to each other. Further, where consideredappropriate, reference numerals have been repeated among the figures toindicate corresponding elements.

Before describing in detail the various aspects of the presentinvention, it would be useful in the understanding of the invention toprovide a more detailed description of the air regulator inhalationnoise that was briefly described above. Inhalation noise is a result ofhigh-pressure air entering a SCBA or other pressurized air deliverysystem mask when a person inhales and the regulator valve opens.Turbulence at the valve creates a very loud, broadband hissing noise,directly coupled into the SCBA mask, which is comparable in amplitude atthe microphone with the speech signal. An example of a typicalinhalation noise 400 recorded inside of a SCBA mask and its wide-bandspectrogram 500 are shown, respectively, in FIGS. 4 and 5.

As can be seen in FIG. 5, the noise spectrum is broadband with prominentspectral peaks occurring at approximately 500, 1700, 2700, and 6000 Hz.The peaks are due to resonances within the mask and comb filtering dueto internal mask reflections, and may vary in frequency and magnitudewith different mask models, sizes, and configurations. The coloration ofthe noise spectrum is typically stationary for a particular mask/wearercombination since the gross internal geometry is essentially constantonce the mask is placed on the face. This is demonstrated in FIG. 6where the spectra of three separate inhalation noises (610, 620 and 630)from a SCBA mask microphone, for the same speaker wearing a given SCBAmask, are shown superimposed. This consistency has also been observedfor different speakers and for masks from different manufacturers.Moreover, high spectral similarity of the air regulator noise fromdifferent speakers wearing the same mask was also observed.

Finally, FIG. 7, illustrates an example of speech 710 recorded from aSCBA system. As FIG. 7 demonstrates, the effects of inhalation noise 720are not on the speech itself, since people do not normally try to speakwhile inhaling. However, the noise is of sufficient energy and spectrumto cause problems with speech detector and noise suppression circuitryin radios and to present a listening annoyance.

In a first aspect of the present invention is a method and apparatus fordetecting and eliminating inhalation noise in a pressurized air deliverysystem coupled to a communication system, such as a system 100illustrated in FIG. 1. The method in accordance with this embodiment ofthe present invention is also referred to herein as the ARINA (AirRegulator Inhalation Noise Attenuator) method. The basis of the ARINAmethod for identifying and eliminating air regulator inhalation noise isthe relative stationarity of the noise as compared to speech and ascompared to other types of noise such as, for instance, variousenvironmental noises. A block diagram of the ARINA method 800 is shownin FIG. 8 and can be divided into four sections: Noise Model Matching810, Noise Detection 830, Noise Attenuation 850, and Noise ModelUpdating 870.

The basic methodology of the ARINA method 800 can be summarized asfollows. Method 800 models the inhalation noise preferably using adigital filter (e.g. an all pole linear predictive coding (LPC) digitalfilter). Method 800 then filters the audio input signal (i.e., speechand noise picked up by the mask microphone) using an inverse of thenoise model filter and compares the energy of the output of the inversenoise model filter with that of the input signal or other energyreference. During the signal periods in which a close spectral matchoccurs between the input signal and the model, the regulator inhalationnoise comprising the input signal may be attenuated to any desiredlevel.

Turning to the specifics of the ARINA method 800 as illustrated in FIG.8, the first step in the processing is to detect the occurrence of theinhalation noise by continuously comparing an input signal 802 against areference noise model via the Noise Model Matching section 810 of method800, which may in the preferred embodiment be implemented in accordancewith FIG. 9 or FIG. 10 depending on the complexity of implementationthat can be tolerated. However, those of ordinary skill in the art willrealize that alternative spectral matching methods may be used. The twopreferred matching methods indicated above as illustrated in FIG. 9 andFIG. 10 are referred to herein as the Normalized Model Error (or NME)method and the Itakura-Saito (or I-S) distortion method. In bothmethods, the reference noise model is represented by a digital filter(912, 1012) that approximates the spectral characteristics of theinhalation noise. In the preferred embodiment, this model is representedas an all-pole (autoregressive) filter specified by a set of LPCcoefficients. However, those of ordinary skill in the art will realizethat alternate filter models may be used in place of the all-pole modelsuch as, for instance, a known ARMA (autoregressive moving average)model.

The reference noise model filter coefficients are obtained from a set ofautocorrelation coefficients derived from at least one digitized sampleof the inhalation noise. An initial noise sample and correspondinginitial autocorrelation coefficients (872) may be obtained off-line fromany number of noise pre-recordings and is not critical to theimplementation of the present invention. Moreover, experiments haveshown that the initial noise sample from one SCBA mask, for example,also works well for other masks of the same design and in some cases formasks of different designs. The autocorrelation coefficients can becalculated directly from raw sampled noise data, or derived from othercommonly used spectral parameter representations such as LPC orreflection coefficients, using common methods well known to thoseskilled in the art.

In the preferred embodiment, the noise model autocorrelationcoefficients are calculated according to the following standard formula:$\begin{matrix}{{R_{i} = {{\sum\limits_{n = 1}^{N - i}{x_{n}x_{n + i}\quad i}} = 0}},1,2,\ldots\quad,p,{p{N}}} & {{EQ}\text{-}1}\end{matrix}$where R_(i) is the ith coefficient of a maximum of p autocorrelationcoefficients, x_(n) is the nth sample of a typical inhalation noisesignal sample segment in which there are a maximum of N samples, and R₀represents the energy of the entire segment. The order of theautocorrelation function, p, is typically between 10 and 20 with thevalue for the preferred embodiment being 14. Moreover, ideally the Nsignal samples are windowed using a Hamming window before theautocorrelation is performed to smooth the spectral estimate. TheHamming window is described by:w(n)=0.54−0.46 cos(2πn/N), n=0, 1, 2, . . . , N−1.  EQ-2Those of ordinary skill in the art will realize that other windowingmethods may also be used.

The noise model autocorrelation coefficients are next used to determinea set of 10^(th) order noise model LPC coefficients, a₁, a₂ . . . ,a_(p), representing an all-pole linear predictive model filter with az-domain representation transfer function of: $\begin{matrix}{{{H(z)} = \frac{1}{1 + {a_{1}z^{- 1}} + {a_{2}z^{- 2}} + \ldots\quad + {a_{p}z^{- p}}}},} & {{EQ}\text{-}3}\end{matrix}$where z=e^(−jnωT) is the z-transform variable. In this example 10^(th)order LPC coefficients were determined. However, a different order ofLPC coefficients may be selected based on the particular implementation.The autocorrelation-to-LPC parameter transformation (step 912, 1012) maybe done using any number of parameter transformation techniques known tothose skilled in the art. In the preferred embodiment, the LPCparameters are derived from the autocorrelation parameters using theDurbin method well known to those skilled in the art.

Turning now to the specifics of the NME spectral matching methodillustrated in FIG. 9, the derived all-pole LPC noise filter model isinverted to form an inverse LPC filter (step 914):Ĥ(z)=1+a ₁ z ⁻¹ +a ₂ z ⁻² + . . . +a _(p) z ^(−p).  EQ-4Ideally a low-pass filtered and sampled audio input signal 802 obtainedfrom the mask microphone and containing speech and inhalation noise,S(z), is passed through the inverse filter Ĥ(z) (step 914) to obtain anoutput signal,Y(z)=S(z)Ĥ(z).  EQ-5The energies, E_(in), E_(out), of the inverse filter input and outputsignals are then calculated (respectively at steps 918 and 916) and adistortion measure D is calculated at step 920 and functions as asimilarity measure between the noise model and the input signal. Thetheoretical lower bound on D is zero for an infinite order, but inpractice, the lower bound will be determined by the input signal and howwell it matches the noise model of finite order. In this implementation,the distortion measure is defined by a ratio of E_(out) to E_(in),referred to as the normalized model error (NME), calculated at step 920as: $\begin{matrix}{D = {{NME} = {\frac{E_{out}}{E_{in}} = {\frac{{{Y(z)}}^{2}}{{{S(z)}}^{2}}.}}}} & {{EQ}\text{-}6}\end{matrix}$The energy of the input signal may then be removed in accordance to howwell it matches the noise model. In the preferred embodiment, the abovedescribed signal filtering is done via convolution in the time domainalthough it could also be done in the frequency domain as indicated inthe preceding equations.

The signal processing for the ARINA method 800 is generally done on asegmented frame basis. In the preferred embodiment, the input signal 802is low-pass filtered, sampled at 8.0 KHz, buffered into blocks of 80samples (10 msec), and passed through the inverse noise model filter(EQ-5). Thus, all filtering is ideally done on consecutive, 80 samplesegments of the input signal 802. The normalized model error (NME) ofthe inverse noise model filter is then calculated by dividing the filteroutput frame energy by the input signal frame energy (EQ-6). Thiscalculation, however, is ideally done on a sub-frame basis for bettertime resolution. Thus, each 80-point frame is divided into sub-frames,for example 4, 20-point sub-frames, although alternative sub-framedivisions may be used depending on the degree of accuracy required. Theoverall normalized model error signal (NME) may then be smoothed byaveraging the output filter energy E_(out) of the last 16 sub-frames anddividing that quantity by the average of the corresponding time-aligned16 sub-frame input filter energies E_(in). This does not add any delayto the analysis but helps remove transient dropouts and the effects ofother loud background noises that may alter the regulator noisespectrum. The average NME value is thereby used, in this implementationof the present invention, as a measure of the noise model to inputsignal spectral similarity.

In the preferred embodiment, the second, more complex but more accuratenoise model matching method 810 as illustrated in FIG. 10 is amodification of the Itakura-Saito distortion method. The I-S method ofdetermining the spectral similarity between two signals is well known bythose skilled in the art. In this method the residual noise modelinverse filter energy is compared with the residual energy of the“optimal” signal filter instead of with the input signal energy as inthe previously described NME method. The filter is “optimal” in thesense that it best matches the spectrum of the current signal segment.

The residual energy corresponding to the optimally filtered signal iscalculated using steps 1018-1024. In the I-S method at step 1018,ideally two consecutive 80 sample buffers of the input signal 802 arecombined into a single 160 sample segment. The 160 sample segment iswindowed preferably using a 160 point Hamming window given by:w(n)=0.54−0.46 cos(2πn/160), n=0, 1, 2, . . . , 159.  EQ-7The windowed signal data is then autocorrelated using the methoddescribed in EQ-1. These autocorrelation coefficients generated in step1018 are designated as {circumflex over (R)}_(i), i=0, 1, 2, . . . , p.A corresponding set of LPC coefficients is derived from theautocorrelation coefficients preferably using the Durbin algorithm instep 1020 in the same manner as used for generating the reference noisemodel parameters in step 1012. The signal model LPC coefficientsgenerated in step 1020 are designated as â_(i) i=1, 2, . . . , p. Instep 1022, these LPC coefficients (step 1020) are autocorrelatedaccording to EQ-9 below yielding {circumflex over (b)}_(i). Using theseparameters, the residual energy of the signal, E_(s), passing throughthis filter is calculated at step 1024 as: $\begin{matrix}{{E_{s} = {{{\hat{b}}_{0}{\hat{R}}_{0}} + {2{\sum\limits_{i = 1}^{p}{{\hat{b}}_{i}{\hat{R}}_{i}}}}}},} & {{EQ}\text{-}8} \\{{{\hat{b}}_{i} = {\sum\limits_{j = 0}^{p - i}{{\hat{a}}_{j}{\hat{a}}_{j + i}}}},{0 \leq i \leq p},{{\hat{a}}_{0} = 1.}} & {{EQ}\text{-}9}\end{matrix}$The energy of the input signal passing through the noise model iscalculated using steps 1012-1016. At step 1012 the noise model LPCcoefficients are calculated from the noise model autocorrelationcoefficients (874) as described above. These LPC coefficients generatedat step 1012 are designated as a_(i) i=1, 2, . . . , p. At step 1014,the LPC coefficients (from step 1012) are autocorrelated according toEQ-11 below yielding b_(i). Using these parameters and theautocorrelation sequence calculated at step 1018, {circumflex over(R)}_(i), the energy of the signal passing through the reference noisemodel is calculated at step 1016 as given by EQ-10: $\begin{matrix}{{E_{m} = {{b_{0}{\hat{R}}_{0}} + {2{\sum\limits_{i = 1}^{p}{b_{i}{\hat{R}}_{i}}}}}},} & {{EQ}\text{-}10} \\{{b_{i} = {\sum\limits_{j = 0}^{p - i}{a_{j}a_{j + i}}}},{0 \leq i \leq p},{a_{0} = 1.}} & {{EQ}\text{-}11}\end{matrix}$A measure of the spectral distortion, D, of the “optimal” signal modelto the reference noise model is calculated at step 1028 as defined as:$\begin{matrix}{D = {\frac{E_{m}}{E_{s}}.}} & {{EQ}\text{-}12}\end{matrix}$The more similar the signal model is to the reference noise model thecloser the distortion measure is to 1.0 which is the lower bound. Thisdistortion measure is used by the Noise Detection section 830 of theARINA method 800 to determine the presence of inhalation noise. The I-Sdistortion measure is calculated using 160 samples in the preferredembodiment. The inhalation noise classification as determined by the I-Sdistortion measure is associated with each 80 sample frame of the 160sample segment. Moreover, steps 1012 and 1014 need only be performed togenerate an initial noise model (e.g., based on initial autocorrelationcoefficients 872) or to update the noise model in accordance with theNoise Model Updating section 870 referred to above and described indetail below.

In the Noise Detection portion 830 of the ARINA method 800, the valuederived from the spectral match 810 (i.e. the NME or the I-S distortionmeasure which represents the similarity measure between the input signaland the noise model) is then compared (step 832) to an empiricallyderived threshold value (e.g., D_(min1)). This detection threshold isselected to detect the presence of inhalation noise while notmisclassifying speech or other types of noise as inhalation noise.

Moreover, depending on the specificity of the noise filter model, thespectral variations of the inhalation noise, and the similarity of somespeech sounds to the noise model, for instance, false detections canoccur. Therefore, since the duration of a true air regulator inhalationnoise is fairly long compared to the speech artifacts, a noise durationthreshold test is ideally also applied (step 834). Thus, the detectionthreshold must be met for a predetermined number of consecutive frames“K₁” (e.g. 4 frames) before detection is validated. Relative signalenergy, waveform zero-crossings, and other feature parameter informationmay be included in the detection scheme to improve speech/inhalationnoise discrimination. Thus if both threshold criteria are met (fromsteps 832 and 834), the spectral match is deemed acceptably close and aninhalation noise is assumed currently present.

In the Noise Attenuation portion 850 of the ARINA method 800, the outputof the Noise Detection portion 830 is used to gate an output signalmultiplier (852) through which the input signal 802 is passed. If theinhalation noise was detected, the multiplier gain G is set at step 854to some desired attenuation value “G_(min)”. This attenuation gain valuemay be 0.0 to completely eliminate the noise or may be set to a highervalue to not completely eliminate the inhalation noise but to suppressit. Total suppression may not be desired to assure a listener that theair regulator is functioning. In the preferred embodiment G_(min) has avalue of 0.05. Otherwise if inhalation noise is not detected, the gain Gis ideally set to 1.0 such as not to attenuate the speech signal.Variations of this gating/multiplying scheme can be employed. Forexample variations may be employed that would enable that the attack anddecay of the gating to be less abrupt, reducing the possibility ofattenuating speech that may occur directly before or after an inhalationnoise, thereby improving the perceived quality of the speech. Moreoveras can be readily seen from method 800, an important benefit of thisinvention is that the original signal is not altered except whenregulator noise is detected, unlike conventional, continuous noisefiltering methods.

An important component of the ARINA method 800 is the ability toperiodically update the noise model for detection purposes. For example,over time, movement of the air mask on the face may cause changes in itseffects on the acoustic transfer function. Also, an air mask worn bydifferent people or the use of different masks will mean that thespectrum of initial reference noise model may deviate from the actualinhalation noise spectrum. By periodically updating the originalreference noise model, an accurate current reference noise model can bemaintained. Accordingly, the Noise Model Updating Section 870 of theARINA method 800 is used to update the noise model.

The Noise Model Updating section 870 uses the output of the NoiseDetection section 830 to determine when the reference LPC filter modelof the regulator inhalation noise should be updated. For example, theoutput from the Noise Detection section 830 may be compared to a secondempirically determined threshold value (e.g., D_(min2)) at step 876 todetermine whether to update the noise model. When the threshold is met,a number of consecutive sub-frames detected as inhalation noise may becounted (step 878), and the signal samples in each sub-frame stored in abuffer. When the number of consecutive noise sub-frames exceeds athreshold number “K₂” (e.g., 8 sub-frames, 160 samples in the preferredembodiment) a decision is made to update the noise model at step 880. Ifa non-noise sub-frame is detected (e.g., at any of steps 832, 834 and876), the noise frame count is reset to zero at step 884, and the noiseframe count is updated at step 878. The autocorrelation coefficients forthe “K₂” consecutive signal sub-frames representing the currentlydetected inhalation noise may then be calculated at step 882 using thepreviously stated formulas EQ-1 and EQ-2.

These new autocorrelation coefficients are used to update the noisemodel autocorrelation coefficients at step 874. Ideally theautocorrelation coefficients calculated at step 882 are averaged withthe previous noise model autocorrelation coefficients at step 874 usinga simple weighting formula such as, for instance:R _(i) ^(REF) =αR _(i) ^(REF)+(1−α)R _(i) ^(NEW),  EQ-13where R_(i) ^(REF) are the autocorrelation coefficients of the currentreference noise model, R_(i) ^(NEW) are the autocorrelation coefficientsof the currently detected inhalation noise sample, and α is a weightingfactor between 1.0 and 0.0 that determines how fast the initialreference model is updated. This weighting factor can be adjusteddepending on how fast the spectral characteristics of the inhalationnoise change, which as noted previously, is usually slow. A new set ofLPC coefficients for the noise model inverse filter is then recalculatedfrom the updated model autocorrelations at steps 912 and 1012.Constraints can be placed on the adjustment to the noise model so thatlarge deviations from the noise model cannot occur due to falsedetections. In addition, the initial reference noise model coefficients(872) are stored so that the system can be reset to the initial modelstate if necessary. The adaptation capability of method 800 describedabove by reference to the Noise Model Updating section 870 enables thesystem to adapt to the characteristics of a particular mask andregulator and enables optimal detection performance.

Advantages of the ARINA method 800 include that the speech signal itselfis not irreversibly affected by the processing algorithm, as is the casein algorithms employing conventional continuous filtering. An additionaladvantage is that the LPC modeling used here is simple, easily adaptablein real-time, is straightforward, and computationally efficient. Thoseof ordinary skill in the art will realize that the above advantages werenot meant to encompass all of the advantages associated with the ARINAembodiment of the present invention but only meant to serve as beingrepresentative thereof.

In a second aspect of the present invention is a method and apparatusfor equalizing a speech signal in a pressurized air delivery systemcoupled to a communication system, such as a system 100 illustrated inFIG. 1. The method in accordance with this embodiment of the presentinvention is also referred to herein as the AMSE (Air Mask SpeechEqualizer) method. The basis of the AMSE method for equalization is therelative stationarity of the noise as compared to speech and as comparedto other types of noise such as, for instance, various environmentalnoises. Since the same mask resonance conditions affect both theregulator noise and a speech signal, equalizing for the noise shouldalso yield an equalizer appropriate for equalizing the speech signal,although peaks and nulls due to sound reflections will be slightlydifferent between the noise and the speech due to source locationdifferences between the speech and the noise.

The AMSE method uses the broadband air regulator inhalation noise,present in all mask-type pressurized air breathing systems (e.g. anSCBA), to estimate the acoustic resonance spectral peaks and nulls (i.e.spectral magnitude acoustic transfer function) produced by the maskcavity and structures. This spectral knowledge is then used to constructa compensating digital inverse filter in real time, which is applied toequalize the spectrally distorted speech signal and produce an outputsignal approximating the undistorted speech that would be producedwithout the mask. This action improves the quality of the audio obtainedfrom the mask microphone and can result in improved communicationsintelligibility.

Turning to the specifics of the AMSE method, a block diagram of themethod 1100 is shown in FIG. 11 and can be divided into four sections:Noise Model Matching 1110, Noise Detection 1130, Mask SpeechEqualization 1150, and Noise Model Updating 1170. The Noise ModelMatching, Noise Detection and Noise Model Updating sections of the AMSEmethod are ideally identical to the corresponding sections of the ARINAmethod that were described above in detail. Therefore, for the sake ofbrevity, a detailed description of these three sections will not berepeated here. However, following is a detailed description of the MaskSpeech Equalization section 1150 (within the dashed area) of the AMSEmethod 1100.

Using the Speech Equalization Section 1150 of the AMSE method 1100, theinhalation noise reference autocorrelation coefficients are used togenerate an nth order LPC model of the noise at step 1152 using EQ-3above. The LPC model generated in step 1152 characterizes the transferfunction of the mask, e.g., MSK(f) in FIG. 2, and for the inhalationnoise also includes the noise path transfer function NP(f). Preferably a14^(th) order model is suitable but any order can be used. Those ofordinary skill in the art will realize that alternate filter models maybe used in place of the all-pole model such as, for instance, a knownARMA (autoregressive moving average) model. Moreover, the filteringoperations may be implemented in the frequency domain as opposed to thetime domain filtering operations described above with respect to thepreferred embodiment of the present invention.

The LPC model coefficients are then preferably used in an inverse filter(in accordance with EQ-4) through which the speech signal is passed atstep 1156. Passing the speech signal through the inverse filtereffectively equalizes the input signal, thereby removing the spectraldistortions (peaks and notches) caused by the mask transfer functionMSK(f) in FIG. 2. Post filtering at step 1158 using a suitable fixedpost-filter is ideally performed on the equalized signal to correct forany non-whiteness of the inhalation noise, or to give the speech signala specified tonal quality to optimally match the requirements of afollowing specific codec or radio. This post-filtering may also be usedto compensate for the noise path transfer function NP(f) in FIG. 2.

The effect of the equalizer of the AMSE method 800 on air regulatornoise is shown in FIG. 12 for two different order equalization filters.Specifically, FIG. 12 illustrates a spectral representation 1210 of aninhalation noise burst before equalization. Further illustrated are thespectra of the inhalation noise after equalization using a 14^(th) orderequalization filter (1220) and a 20^(th) order equalization filter(1230). As can be seen, the spectral peaking is flattened extremely wellby the 20^(th) order equalization filter and reasonable well using the14^(th) order equalization filter. Moreover, listening tests on maskspeech equalized by these filters showed that the quality of speech wassignificantly improved by use of the equalization filters as compared tothe un-equalized speech. In addition, little difference in perceivedquality of the speech was found between the two filter orders.

Advantages of the AMSE algorithm approach include: 1) it uses a regular,spectrally stable, broadband regulator noise inherent in an air-masksystem as an excitation source for determining mask acoustic resonanceproperties; 2) system transfer function modeling is accomplished inreal-time using simple, well established, efficient techniques; 3)equalization is accomplished in real-time using the same efficienttechniques; and 4) the system transfer function model is continuouslyadaptable to changing conditions in real time. Those of ordinary skillin the art will realize that the above advantages were not meant toencompass all of the advantages associated with the AMSE embodiment ofthe present invention but only meant to serve as being representativethereof.

In a third aspect of the present invention is a method and apparatus fordetermining the duration and frequency of inhalation noise anddetermining respiration rate and air usage volume in a pressurized airdelivery system coupled to a communication system, such as a system 100illustrated in FIG. 1. The method in accordance with this embodiment ofthe present invention is also referred to herein as the INRRA(Inhalation Noise Respirator Rate Analyzer) method. The INRRA method isessentially an indirect way of measuring respiration by monitoring thesound produced by the air regulator instead of measuring breathingsounds from a person. The basis of the INRRA method is that apressurized air breathing system such as an SCBA has one-way airflow.Air can enter the system only from the air source and regulator, andexit only through an exhaust valve. The intake and exhaust valves cannotbe open at the same time. Thus, regulator intake valve action isdirectly related to the user's respiration cycle.

One indicator of the opening of the regulator intake valve is theregulator inhalation noise. Inhalation noise is a result ofhigher-pressure air entering an SCBA or other pressurized air deliverysystem mask. The mask is airtight so when a person inhales it produces aslight negative pressure within the mask that causes the regulator valveto open and pressurized tank air to enter. Air turbulence across thevalve creates a loud, broadband hissing noise that is directly coupledinto the SCBA mask, can be picked up by a microphone, and occurs forevery inhalation. As explained previously, the noise is abrupt and has avery constant amplitude over the duration of the inhalation, providingvery good start and end time resolution. For a given mask type andwearer, the spectral characteristics of the inhalation noise are verystable, as opposed to direct human breath sounds which vary considerablybased on factors such as the size of the mouth opening, vocal tractcondition, and lung airflow. INRRA capitalizes on the stability of theair regulator inhalation noise as a measure of respiratory rate.

INRRA uses a matched filtering scheme to identify the presence of aninhalation noise by its entire spectral characteristic. In addition,INRRA is capable of adapting to changes in the spectral characteristicsof the noise should they occur, thus providing optimal differentiationbetween the inhalation noise and other sounds. By calculating the startof each inhalation, the instantaneous respiration rate and it's timeaverage can be easily calculated from the inhalation noise occurances.In addition, by measuring the end and calculating the duration of eachinhalation noise, and providing some information about the predictablemask regulator flow rate, the system can provide an estimate of theairflow volume. This may be accomplished using only the signal from themicrophone recording the inhalation noise.

A block diagram of the INRRA method 1300 is shown in FIG. 13 and can bedivided into five sections: Noise Model Matching 1310, Noise Detection1330, Inhalation Breath Definer 1350, Parameter Estimator 1370 and NoiseModel Updating 1390. The Noise Model Matching, Noise Detection and NoiseModel Updating sections of the INRRA method are ideally identical to thecorresponding sections of the ARINA method that were described above indetail. Therefore, for the sake of brevity, a detailed description ofthese three sections will not be repeated here. However, following is adetailed description of the Inhalation Breath Definer 1350 and ParameterEstimator 1370 sections of the INRRA method 1300.

First, the Inhalation Breath Definer 1350 will be described. The purposeof section 1350 of the INRRA method 1300 is to characterize theinhalation noise based on at least one factor, for example, in this casebased on a set of endpoints and a duration for one or more completeinhalation noise bursts which correspond with inhalation breaths. Thedecision from the Inhalation Noise Detection section 1330 is used togenerate a preferably binary signal, INM_(m), m=0, 1, 2, . . . , M−1, instep 1352 that represents the presence or absence of inhalation noise asa function of time index m using values of ones and zeros. This binarysignal is stored in a rotating buffer of length M samples, M being largeenough to store enough samples of the binary signal to encompass thetime period of at least two inhalation noise bursts, or breaths at theslowest expected breathing rate. In the preferred embodiment, thisamounts to about 15 seconds. The time resolution of this binary signaland the value of M will be determined by the smallest sub-frame timeused in the Inhalation Noise Detection section 1330, describedpreviously, which depends on the Inhalation Noise Model Matchingsection, and is either 20 samples (2.5 msec) or 80 samples (10 msec),depending on which spectral matching method is used in step 1310.

Since the inhalation noise detector output from 1330 will not always beperfect, detection mistakes may occur during the detection of aninhalation noise causing some ambiguity as to the true start, andduration times of the noise. Thus, the binary inhalation noise signalgenerated by step 1352 is integrated using a well known moving-averagetype or other suitable filter at step 1354. This filter smoothes out anyshort duration detection mistakes and produces a more accurate signalthat defines complete inhalation noise bursts, which correspond withrespiratory breaths. From this signal generated at step 1354, at leastone factor including accurate start time, S_(i), end time, E_(i), andbreath duration time, D_(i), for each noise burst may be determinedwithin processing frame duration accuracy at step 1356. The start andend times of the inhalation noise bursts as represented by the binarysignal INM_(m), are obtained by noting their relative indices within thesignal buffer. The duration D_(i) is defined for a single inhalationnoise burst as:D _(i) =E _(i) −S _(i), i=0, 1, 2, . . . , I_(T),  EQ-14where i designates the ith of I_(T) inhalation noise bursts present inthe binary signal buffer of length M and time period T seconds. Theseinhalation noise burst factor values are ideally stored in a rotating,finite length buffer, one set of parameters per noise burst/breath. Someresults of SCBA mask microphone speech processed by the INRRA algorithmsections 1310, 1330, 1352, and 1354 are shown in FIGS. 14-17, which arebased on speech from a male speaker wearing an SCBA and recorded in aquiet room. FIG. 14 shows the input speech 1420 intermingled with noisebursts 1410. FIG. 15 shows a time-amplitude representation 1500 of thespectral distortion measure D output of Inhalation Noise Model Matchingsection 1310. FIG. 16 shows a time-amplitude representation 1600 of thebinary output of the inhalation noise detector, 1330. FIG. 17 shows atime-amplitude representation 1700 of the output of the moving averagefilter component, 1354, of the breath definer algorithm 1350 thatintegrates the raw detector output and accurately defines the durationof each inhalation.

The Parameter Estimator 1370 section describes examples of parametersthat may be estimated based on the characterization factors of theinhalation noise by the Inhalation Breath Definer section 1350. Two suchexamples of parameters that may be determined are the respiration rateof the user and the approximate inhalation air flow volume. Respirationrate may be easily determined using the sequential start timeinformation, S_(i), of successive inhalation noise bursts that may bedetermined in the Inhalation Breath Definer Section. For example, the“instantaneous” respiration rate per minute may be calculated as:$\begin{matrix}{{{IRR} = \frac{60}{\left( {S_{i} - S_{i - 1}} \right)}},} & {{EQ}\text{-}15}\end{matrix}$where the S_(i) are two successive noise bursts (inhalation breaths)start times in seconds. An average respiration rate may accordingly becalculated as: $\begin{matrix}{{{RR} = \frac{60I_{T}}{\sum\limits_{i = 1}^{I_{T}}\left( {S_{i} - S_{i - 1}} \right)}},} & {{EQ}\text{-}16}\end{matrix}$where I_(T) is the number of detected consecutive breaths (inhalationnoise bursts) in a specified time period T.

The approximate airflow volume during an inhalation breath may beestimated from the duration of the breath that may be determined by theInhalation Breath Definer section, and from some additional informationconcerning the initial air tank fill pressure and the regulator averageflow rate that may be determined off-line, for instance. When the intakevalve is open, the air regulator admits a volume of air at nearlyconstant pressure to the facemask (a function of the ambient air/waterpressure) as long as the air supply tank pressure remains above theminimal input pressure level for the air regulator. Moreover, theairflow rate into the mask is approximately constant while the maskregulator intake valve is open. The amount of air removed from the tanksupply and delivered to the breather is thus proportional to the timethat the intake valve is open. The time that the valve is open can bemeasured by the duration of each inhalation noise.

The initial quantity of air in the supply tank when filled is a functionof the tank volume V₀, the fill pressure P₀, the gas temperature T₀, andthe universal gas constant R, the mass of the gas in moles N_(m), andcan be calculated from the well-known ideal gas equation, PV=N_(m)RT.Since the initial fill pressure and tank cylinder volume may be known,and assuming the temperature of the tank gas and mask gas are the same,the volume of air available for breathing at the mask pressure may begiven as: $\begin{matrix}{V_{M} = {\frac{P_{0}V_{0}}{P_{M}}.}} & {{EQ}\text{-}17}\end{matrix}$The approximate volume of air delivered to the user during inhalationevent i is then:IV _(i) ≈K _(R) D _(i),  EQ-18where IV_(i) is the air volume, D_(i) is the duration of the inhalationevent as determined from the inhalation noise, and K_(R) is acalibration factor related to the airflow rate for a particular airregulator. K_(R) could be derived empirically for an individual systemor perhaps determined from manufacturer's data. From the individualinhalation volumes, IV_(i), the approximate total amount of air used upto a time T, V_(T), may be defined as: $\begin{matrix}{{V_{T} \approx {\sum\limits_{i = 1}^{I_{T}}{IV}_{i}}},} & {{EQ}\text{-}19}\end{matrix}$where I_(T) is the total number of inhalations up to a time T. Theremaining tank supply air is accordingly:V _(R) ≈V _(M) −V _(T).  EQ-20

Some advantages of the INRRA method include that any microphone signalthat picks up the breath noise over a minimal speech bandwidth can beused, and no special sensors are needed. Another advantage is that therespiration detector is based on detecting the noise produced by the airregulator which has stable spectral characteristics, and not humanbreath noises which are variable in character. Yet another advantage isthat the respiration detector is not locked to examining specificfrequencies as are other types of acoustic breath analyzers. Moreover,the system adapts automatically to changes in environment and todifferent users and pressurized air respirator mask systems. Thus, theINRRA method can provide continuously, instantaneous or averagerespiration rate and approximate air use volume data, which is valuableinformation that can be automatically sent outside of system 100, forexample, via a radio data channel to a monitor. Those of ordinary skillin the art will realize that the above advantages were not meant toencompass all of the advantages associated with the INRRA embodiment ofthe present invention but only meant to serve as being representativethereof.

All three methods in accordance with the present invention (ARINA, AMSEand INRRA) are preferably implemented as software algorithms stored on amemory device (that would be included in a system in accordance withsystem 100 described above) and the steps of which implemented in asuitable processing device such as, for instance DSP 138 of system 100.The algorithms corresponding to the autocorrelation and LPC filteringmethods of the present invention would likely take up the majority ofthe processor time. However, these algorithms or the entirety of thealgorithms corresponding to the ARINA, AMSE and INRRA methods may,alternatively, be efficiently implemented in a small hardware footprint.Moreover, since the AMSE method uses many of the methodologies as theARINA method, in another embodiment of the present invention, they maybe efficiently combined.

While the invention has been described in conjunction with specificembodiments thereof, additional advantages and modifications willreadily occur to those skilled in the art. The invention, in its broaderaspects, is therefore not limited to the specific details,representative apparatus, and illustrative examples shown and described.Various alterations, modifications and variations will be apparent tothose skilled in the art in light of the foregoing description. Forexample, although a method for identifying and attenuating inhalationnoise was described above, the methodologies presented with respect tothe present invention may be applied to other types of noise, such asexhalation noise or other types of noises with pseudo-stationaryspectral characteristics lending themselves to efficient detection usingthe above methods. Thus, it should be understood that the invention isnot limited by the foregoing description, but embraces all suchalterations, modifications and variations in accordance with the spiritand scope of the appended claims.

1. A method for equalizing a speech signal generated within apressurized air delivery system, the method comprising the steps of:generating an inhalation noise model based on inhalation noise;receiving an input signal that includes a speech signal; and equalizingthe speech signal based on the noise model.
 2. The method of claim 1,wherein the breathing apparatus includes a mask into which the speechsignal enters and that has a transfer function that affects the speechand wherein the noise model characterizes the transfer function suchthat equalizing the speech signal removes at least a portion of theeffects of the transfer function.
 3. The method of claim 1, wherein thenoise model is represented as a digital filter.
 4. The method of claim3, wherein the digital filter is a linear predictive coding (LPC) filterbased on a set of LPC coefficients that are generated from a set ofautocorrelation coefficients.
 5. The method of claim 4, wherein the LPCcoefficients are generated from the set of autocorrelation coefficientsusing a Durbin method.
 6. The method of claim 4, wherein the step ofgenerating the inhalation noise model comprises the steps of: samplingthe inhalation noise to generate at least one digitized sample of theinhalation noise; determining the set of autocorrelation coefficientsfrom the at least one digitized sample; generating the set of LPCcoefficients based on the set of autocorrelation coefficients; andgenerating the LPC filter from the set of LPC coefficients.
 7. Themethod of claim 6 further comprising the step of windowing the at leastone digitized sample prior to determining the set of autocorrelationcoefficients.
 8. The method of claim 7, wherein the step of windowing isperformed using a Hamming window.
 9. The method of claim 4, wherein thespeech signal is equalized using the inverse of the LPC filter.
 10. Themethod of claim 1 further comprising the step of post-filtering theequalized speech signal.
 11. The method of claim 1, wherein the inputsignal further includes inhalation noise, the method further comprisingthe step of updating the noise model.
 12. The method of claim 11,wherein the step of updating the noise model comprises the steps of:comparing the input signal to the noise model to obtain a similaritymeasure; and comparing the similarity measure to at least one thresholdto detect the inhalation noise in the input signal, wherein the noisemodel is updated based on the detected inhalation noise.
 13. The methodof claim 12, wherein the noise model is a linear predictive coding (LPC)filter based on a set of LPC coefficients that are generated based on afirst set of autocorrelation coefficients, the step of updating thenoise model further comprising the steps of: sampling the detectedinhalation noise to generate at least one digitized sample of thedetected inhalation noise; determining a second set of autocorrelationcoefficients from the at least one digitized sample; updating the firstset of autocorrelation coefficients as a function of the first andsecond sets of autocorrelation coefficients; updating. the set of LPCcoefficients based on the updated set of autocorrelation coefficients;and updating the LPC filter based on the updated set of LPCcoefficients.
 14. A device for equalizing a speech signal in apressurized air delivery system, comprising: a processing element; and amemory element coupled to the processing element for storing a computerprogram for instructing the processing device to perform the steps of:generating an inhalation noise model based on inhalation noise;receiving an input signal that includes a speech signal; and equalizingthe speech signal based on the noise model.
 15. The device of claim 14,wherein the processing element is a digital signal processor.
 16. Asystem for equalizing a speech signal comprising: a pressurized airdelivery system; and a communication system coupled to the pressurizedair delivery system, the system comprising: a processing element; and amemory element coupled to the processing element for storing a computerprogram for instructing the processing device to perform the steps of:generating an inhalation noise model based on inhalation noise;receiving an input signal that includes a speech signal; and equalizingthe speech signal based on the noise model.